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When  we  process  language  --  either  in  written  or  m  spoken  form  -  we  construct 
representations  of  what  we  are  processing  at  many  different  levels.  This  process  is 
profoundly  affected  by  contextual  information  For  example,  in  reading,  we  perceive 
letters  better  when  they  occur  in  words.  We  recognize  words  better  when  they  occur  m 
sentences  We  interpret  the  meanings  of  words  m  accordance  with  the  contexts  they 

occur  in  We  assign  grammatical  structures  to  sentences,  based  on  the  thematic 

constraints  among  the  constituents  of  the  sentences.  Many  authors  --  Huey  (1908/1968) 
Neisser  (1967)  and  Rumeihart  (1977).  to  name  a  few  --  has  amply  documented  some  or 
all  of  these  points 

Clearly  this  use  of  contextual  information  is  based  on  what  we  know  about  our 

language  and  about  the  world  we  use  language  to  tell  each  other  about.  How  does 
this  knowledge  enter  into  language  processing'^  How  does  it  allow  contextual  factors  to 
influence  the  course  of  processing^ 

In  this  paper.  I  will  describe  a  set  of  theoretical  principles  about  the  nature  of  the 

mechanisms  of  language  processing  that  provides  one  possible  set  of  answers  to  these 

questions  These  principles  combine  to  form  a  framework  which  I  will  call  the  interactive 
activation  framework.  The  paper  has  three  main  parts,  in  the  first  part,  l  will  describe 
the  principles  and  explore  a  central  reason  why  they  offer  an  appealing  account  of  the 
role  of  knowledge  in  language  processing  in  the  second  part  l  will  considei  two 

prominent  lines  of  empirical  investigation  that  have  been  offered  as  evidence  against  the 
view  that  particular  parts  of  the  processing  system  are  influenced  by  multiple  sources  of 
information,  as  the  interactive  acfivafion  framework  assumes  Finally,  in  the  third  part.  I 
will  discuss  one  way  in  which  interactive  processing  might  distinguish  itself  empirically 
from  mechanisms  that  employ  a  one-way  flow  of  information. 

To  summarize  the  main  points  of  each  part: 

•  In  the  interactive  activation  framework,  the  knowledge  that  guides  processing 
is  stored  in  the  connections  between  units  on  the  same  and  adjacent  levels. 

The  processing  units  they  connect  may  receive  input  from  a  number  of 

different  sources.  This  allows  the  knowledge  that  guides  processing  to  be 
completely  local,  while  at  the  same  time  allowing  the  results  of  processing  at 
one  level  to  influence  processing  at  other  levels,  both  above  and  below 
Thus,  the  approach  combines  a  desirable  computational  characteristic  of  an 
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encapsuiafionist  position  (Fodor.  1983)  while  retaining  the  capacity  to  exploit 
the  benefits  of  interactive  processing 

•  Two  sources  of  empirical  evidence  that  have  been  taken  as  counting  against 
interactionism  do  not  stand  up  to  scrutiny  The  first  case  is  the  resolution  of 
lexical  ambiguity  in  context.  Here  l  re-examine  existing  data  and  compare 
them  with  simulation  results  illustrating  general  characteristics  of  interactive 
activation  mechanisms  to  show  that  the  findings  are  completely  consistent 
with  an  interactive  position.  The  second  case  considered  is  the  role  of 
semantic  constraints  m  the  resolution  of  syntactic  ambiguities  Here  l  review 
some  recent  data  that  demonstrates  the  importance  of  semantic  factors  in 
phenomena  that  had  been  taken  as  evidence  of  a  syntactic  processing 
strategy  that  is  impervious  to  semantic  influences.  In  both  cases  I  will  argue 
that  the  evidence  is  just  what  would  be  expected  on  an  interactive  activation 
account 

•  It  's  an  important  and  challenging  task  to  find  experimental  tests  that  can 
distinguish  between  an  interactive  system  and  one  in  which  information  flows 
only  in  one  direction  Unidirectional  and  mteractiomst  models  can  make 
Identical  predictions  for  a  large  number  of  experiments,  as  long  as  it  is 
assumed  that  lower  levels  are  free  to  pass  on  ambiguities  they  cannot 
resolve  to  higher  levels.  However,  experimental  tests  can  be  constructed 
using  higher-level  influences  to  trigger  effects  assumed  to  be  based  on 
processing  at  lower  levels.  I  will  illustrate  this  method  by  describing  a  recent 
experiment  that  uses  it  to  provide  evidence  of  lexical  effects  on  phonetic 
processing,  and  I  will  suggest  that  this  method  may  help  us  examine  higher 
level  influences  on  lower  levels  of  processing  in  other  cases,  as  well. 


The  Interactive  Activation  Framework 

The  following  principles  characterize  the  interactive  activation  framework.  These 
principles  have  emerged  from  work  with  the  interactive  activation  model  of  visual  word 
recognition  (McClelland  and  Rumelhart.  1981;  Rumelhart  and  McClelland.  1982).  the 
TRACE  model  of  speech  perception  (Elman  and  McClelland.  1986:  McClelland  and 
Elman.  1986)  and  the  programmable  blackboard  model  of  reading  (McClelland.  1985; 
1986)  The  principles  apply.  I  believe,  to  the  processing  of  both  spoken  and  written 
language,  as  well  as  to  the  processing  of  other  kinds  of  perceptual  inputs;  however  all 
the  examples  I  will  use  here  are  taken  from  language  processing. 

•  The  processing  system  is  organized  into  levels  This  principle  is  shared  by 
virtually  all  models  of  language  processing  Exactly  what  the  levels  are.  of 
course,  is  far  from  clear,  but  this  is  not  our  present  concern.  For  present 
purposes.  i  will  adopt  an  illustrative  set  of  levels  to  provide  a  context  in 
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which  to  discuss  the  processing  interactions  that  may  be  involved  in  reading 
a  sentence  These  levels  are  a  visual  feature  level,  a  letter  level,  a  word 

level  a  syntactic  level  a  word-sense  level,  and  a  scenario  level,  on  which 
the  representation  captures  the  non-iinguistic  state  or  action  described  by  the 
sentence  being  processed.  Higher  levels  are  of  course  required  for  longer 
passages  of  text,  but  the  set  of  levels  will  provide  a  sufficient  basis  for  the 
phenomena  we  will  consider  here  For  processing  speech,  we  also  need  a 
phonetic  level  and  an  auditory  feature  level  to  provide  input  to  the 

phonological  level 

•  The  representation  constructed  at  each  .'eve/  is  a  pattern  of  activation  over  an 

ensemble  of  simple  processing  units  This  assumption  is  central  to  the  entire 
interactive  activation  approach,  and  strongly  differentiates  it  from  other 

approaches  in  this  approach,  representations  are  active  ■■  they  can 
influence,  and  be  influenced  by.  representations  at  other  levels  of  processing 
In  this  paper  I  will  adopt  the  formal  convenience  of  assuming  that  individual 
processing  units  stand  for  individual  conceptual  objects  such  as  letters 
words  phonemes  or  syntactic  attachments.  Thus,  a  representation  of  a 
spoken  word  at  the  phonetic  level  is  a  pattern  of  activation  over  units  that 

stand  for  phonemes:  these  units  are  role  specific,  so  that  the  pattern  of 
activation  of  "cat"  is  different  from  the  pattern  of  activation  of  "tac" 

•  Activation  occurs  through  processing  interactions  that  are  are  bi-directional  both 

within  levels  and  Oefween  levels  A  basic  assumption  of  the  framework  is  that 
processing  interactions  are  always  reciprocal;  it  is  this  bi-direciionai 

characteristic  that  makes  the  system  interactive.  Bi-directional  excitatory 
interactions  between  levels  allow  mutual  simultaneous  constraint  among 
adjacent  levels  and  bi-directional  inhibitory  interactions  within  a  level  allow  for 
competition  among  mutually  incompatible  interpretations  of  a  portion  of  an 
input  The  between-level  excitatory  interactions  are  captured  in  these  models 
in  two-way  excitatory  connections  between  mutually  compatible  processing 
units:  thus  the  unit  for  word-initial  IV  has  an  excitatory  connection  to  the  unit 
for  the  word  /tac/.  and  receives  an  excitatory  connection  from  the  unit  for  the 
word  /tac/ 

•  Between-level  processing  interactions  occur  between  adjacent  levels  only  This 
assumption  is  actually  rather  a  vague  one.  since  adjacency  itself  is  a  matter 
of  assumption.  l  mention  it  because  it  restricts  the  direct  processing 
interactions  to  a  reasonably  small  and  manageable  set.  rather  than  allowing 
everything  to  directly  influence  everything  else  One  possible  set  of 
interactions  between  levels  is  sketched  in  Figure  1.  Note  that  even  though 
some  pairs  of  levels  are  not  directly  connected,  each  level  can  influence 
each  other  level  indirectly,  via  indirect  connections. 


•  Between-level  interactions  are  excitatory  only,  within  level  interactions  are 
competitive  A  feature  of  the  interactive  activation  framework  that  has 
gradually  emerged  over  the  years  is  the  idea  that  between-level  interactions 
should  be  excitatory  only,  so  that  a  pattern  of  activation  on  one  level  will 


-iAr.RE  -£-E_ 


Figure  1:  A  set  Of  Dossibie  orocessmg  levels  and  connections  among  tnese  levels  m 
an  interactive  activation  model  eacn  level  would  consist  of  a  large  number 
of  simoie  orocessmg  units  No  claim  is  made  mat  tnis  is  exactly  the  ngnt 
S9'  of  'eveis  this  set  is  given  for  illustrative  ourposes  only  Bi-direct'cnai 
e<0'tatory  connections  are  recresented  by  doubied-headed  arrows  be'.'.ee^ 
'le'ghooring  levels  inhibitory  »vi.thin-ievei  connections  are  represented  bv  mo 
'mes  ending  m  dots  that  loop  back  onto  each  level 

•end  to  excite  comoatibie  patterns  at  adiacent  levels,  but  will  not  directly 
mnibit  incompatible  patterns  The  inhibition  of  incompatible  patterns  is 
assumed  to  occur  via  competition  among  alternative  patterns  of  activation  on 
the  same  level  This  idea  is  characteristic  of  assumptions  made  by 
Grossberg  ii9T6  and  elsewhere),  and  its  utility  has  become  clearer  m  later 
versions  of  interactive  activation  models  i  McClelland  and  Elman.  1986 
McClelland  1985)  The  principle  reason  for  this  assumption  is  that  it  allows 
possible  alternative  representations  to  accumulate  support  from  a  numpor  pf 
sources  then  to  compete  with  other  alternative  possibilities  so  mat  me  one 
.Mth  tne  most  support  can  dominate  ail  the  others  This  allows  me  networy 
•Q  imoiement  a  best  match'  strategy  of  choosmg  representations,  for 
example,  a  sequence  of  phonemes  that  does  not  exactly  match  any  particular 
word  .Mil  nevertheless  activate  the  closest  word  Thus  "parageet"  for 
example  can  result  m  me  recognition  of  the  .vord  parakeet  even  though  n 
does  not  match  parakeet  exactly 


Acf.'vaf'ons  and  connections  are  continuously  graded  The  activation  of  a 
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representation  is  a  matter  of  degree,  as  is  the  strength  of  the  influence  one 
representation  exerts  on  another  Degree  of  activation  of  a  unit  reflects  the 
strength  of  the  hypothesis  that  the  representational  object  the  unit  stands  for 
is  present:  the  strengths  of  the  connections  between  units  reflect  the 
strengths  of  the  contingencies  that  hold  between  the  representational  objects 

•  The  activation  process  is  non-lmear  Each  processing  unit  in  an  interactive 

activation  network  performs  a  very  simple  computation  it  adds  up  all  of  the 
weighted  excitatory  influences  it  receives  from  other  units  and  subtracts  from 
these  the  weighted  inhibitory  influences  that  it  receives  from  competing  units 
Then,  it  updates  is  activation  to  reflect  this  combined  (what  I  will  call  net) 
input.  The  activation  of  the  unit  is  monotonically,  but  not  linearly,  related  to 
this  sum;  at  high  levels  of  excitatory  input,  activation  levels  off  at  a  maximum 
value,  and  with  strong  inhibitory  input,  it  levels  off  at  a  minimum  value 
Because  of  these  non-linearities,  and  because  of  the  competitive  interactions 
among  units,  inputs  that  are  sometimes  crucial  for  determining  the  outcome 
of  processing  may  have  little  or  no  effect  at  other  times'  The  specific 
details  of  the  non-lmear  activation  assumptions  that  l  have  used  are  based 
on.  though  not  identical  with,  those  used  by  Grossberg  (eg.  Grossberg. 

1978), 

•  Activation  builds  up  and  decays  over  time,  it  is  assumed  that  processing 

interactions  occur  continually,  but  that  the  activation  process  is  gradual  and 
incremental,  so  that  it  takes  time  for  activation  to  propagate  through  the 
system.  New  inputs  begin  to  have  their  effects  immediately,  but  these 

effects  build  up  over  time  and  then  gradually  decay  away  as  processing 

continues. 

These  assumptions  are  now  being  applied  in  the  construction  of  models  of  higher- 
level  aspects  of  language  processing,  such  as  the  assignment  of  constituents  of 
sentences  to  semantic  roles  and  disambiguation  of  word  meaning  in  context  (Cottrell. 
1985.  Waltz  and  Pollack.  1985:  Kawamoto.  1985:  McClelland  and  Kawamoto.  1986).  At 
higher  levels  of  processing,  I  and  other  researchers  have  tended  to  build  models  that 
make  explicit  use  of  distributed  representation,  in  which  a  conceptual  object  is 
represented  by  a  pattern  of  activation,  rather  than  a  single  unit  (Hinton,  McClelland,  and 
Rumelhart.  1986).  However,  even  here  it  is  convenient  to  speak  of  whole  patterns  of 
activation  as  though  they  were  separate  information-processing  constructs,  that  interact 


T;  IS  ^'orih  noting  that  this  non. linear  characteristic  is  absolutely  essential  lo  the  ooeration  oi  'he  nei'.-'cif 
as  a  whole;  it  all  units  m  the  system  behaved  linearly,  no  purpose  I'Ouid  be  served  bv  having  miiiiipie 
'eveis  and  none  but  the  most  trivial  of  computational  operations  could  be  periormed.  Furthermore, 
'eedbacif  from  higher  levels  to  lower  levels  can  'ead  to  runaway  activation  m  a  linear  system  For 
d'sc'jss’on  see  Fumeihart,  Hmton.  and  McClelland  i'986t 
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.vith  each  other  via  excitatory  and  inhibitory  contingencies  indeed  the  distributed 
reotesentation  can  be  seen  as  an  implementation  of  the  more  abstract  functional 
description  isee  Smolensky,  1986  for  a  discussion  of  this  issue) 

Encapsulated  Knowledge,  Interactive  Processing 

in  his  book  on  Modularity,  Fodor  (1903)  explains  a  virtue  of  dividing  up  the 
knowledge  that  is  used,  and  encapsulating  portions  of  it  in  separate  modules  each 
dedicated  to  a  specific  part  of  a  complex  information  processing  task  Encapsulation  of 

knowledge  allows  he  notes,  for  automatized,  reflex-like  processing  in  each  module,  since 

each  module  need  only  consult  a  finite  store  or  localiy-relevant  information 

The  interactive  activation  framework  adheres  to  this  desirable  property  A  central 
feature  of  the  framework  is  the  fact  that  the  knowledge  that  guides  processing  is 
intrinsically  local  and  inaccessible  to  other  portions  of  the  network.  To  see  this  it  is 
useful  to  focus  attention  on  the  connections  between  some  pair  of  adjacent  levels  in  the 
system  for  example,  the  connections  from  the  letter  level  to  the  word  level.  These 
connections  are  the  knowledge  that  allows  ihe  system  to  form  appropriate  word  level 

representations  from  patterns  of  activation  at  the  letter  level.  They  express  contingencies 

between  activations  of  units  at  the  letter  level,  and  activations  of  units  at  the  word  level 
This  information  is  Completely  encapsulated  within  this  part  of  the  processing  mechanism 
It  is  never  consulted  by  any  other  part  of  the  mechanism  By  the  same  token,  this  part 
of  the  mechanism  never  consults  the  knowledge  stored  in  any  other  part  in  doing  its 
lOb  which  is  simply  to  supply  input  to  the  units  at  the  word  level  We  have  then,  a 
system  in  which  the  knowledge  is  completely  encapsulated 

At  the  same  time,  the  architecture  of  the  system  overcomes  what  I  believe  is  an 
unnecessary  limitation  that  Fodor  places  on  modular  systems;  that  is  that  the  output  of  a 
module  be  independent  of  influences  from  other  sources  interactive  activation  provides 

a  framework  for  processing  in  which  multiple  sources  of  information  can  influence  the 

construction  of  representations  at  each  level.  This  is  because  each  level  combines 
inputs  it  receives  from  multiple  sources  in  determining  what  its  pattern  of  activation  shall 

be  The  input  a  level  receives  from  a  particular  adjacent  level,  then,  simply  constitutes 

one  source  of  constraint  on  the  construction  of  a  representation  that  is  subject  to 
influence  by  other  sources 
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Where  Fodor  s  analysis  went  astray  l  believe,  is  m  assuming  that  the  combined 
use  of  constraints  from  multiple  sources  requires  each  module  in  the  system  to  have 
access  to  knowledge  of  many  different  types  What  the  interactive  activation  framework 
makes  clear  is  that  this  is  not  the  case  Each  processing  level  -  each  set  of  units  - 
provides  a  device  that  performs  a  very  general  computation  that  allows  it  to  combine 
inputs  from  a  number  of  sources.  This  general  computational  characteristic  of  interactive 
activation  mechanisms  provides  a  simple  way  knowledge  at  all  different  levels  to  exert 
simultaneous  influence  on  the  outcome  of  processing,  without  requiring  any  part  of  the 
system  to  know  very  much  at  ail^ 

An  Examination  of  the  Evidence 

No  one  doubts  that  the  ultimate  outcome  of  processing  is  sensitive  to  influences 

from  many  levels  The  psychological  literature  is  replete  with  demonstrations  of  such 
effects  But  many  researchers  have  questioned  the  view  that  the  influences  exerted  by 
higher  levels  occur  through  direct  influences  from  higher  levels  back  down  Into  lower 

levels  of  processing.  There  are  two  poles  to  this  argument  First,  the  results  of  some 
experiments  have  been  taken  as  evidence  against  an  interactive  view,  at  least  with 
respect  to  certain  aspects  of  processing  Second  it  is  often  pointed  out  that  results 
that  could  be  attributable  to  interactive  processing  might  be  explained  m  other  ways 
Fodor  (19831  makes  this  point  repeatedly. 

I  will  consider  these  two  aspects  of  the  argument  against  mteractionism  m  turn 
First  I  will  consider  'wo  cases  of  experimental  findings  that  have  been  taken  as  evidence 
against  interactionism  in  two  specific  cases.  Here  my  aim  is  to  show  that  the 

experimental  facts,  when  looked  at  closely  turn  out  to  be  perfectly  consistent  with  an 
interactive  activation  account  l  do  not  mean  to  say  that  they  cannot  be  interpreted 

without  recourse  to  interaction  between  levels  Though  the  phenomena  are  just  what  we 


I  should  Idle  "lai  suggests  other  reasons  than  co''noutationai  eff'oiencv  'or  advocaimg  autonorriv 

o'  crocessi"g  i^or  one  ihmg,  he  suggests  if  modules  are  autonomous  -t  mav  be  easier  'or  Ccgnui.e 
'ceniisis  '0  analyze  exactly  «bat  'unctions  each  module  'romouies  While  this  might  //eii  be  me  case  ’’ 
seems  jniikety  'hat  the  convenience  of  Cognitive  Scientists  entered  mto  ihe  design  of  our  ^omoutaiicnai 
machinery,  comoutaiionai  considerations  seem  more  iikeiv  to  nave  influenced  the  course  of  evolution:  and 
my  argument  ■$  mat  such  considerations  favor  mreracfiomsm 
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encecf  from  an  interactive  activation  approach  there  can  be  alternative  mte''ore'ations 
In  a  later  section  l  -.vill  turn  specifically  to  the  question  of  how  one  might  find  evidence 
that  more  clearly  favors  an  interactive  activation  view 

The  Case  Against  Interactionism 

The  two  cases  i  will  consider  both  purport  to  demonstrate  the  autonomy  of  some 
aspect  ot  processing  from  higher-level,  or  contextual  influences  One  of  these  cases 
concerns  accessing  word  meanings  The  other  concerns  the  mechanism  that  determines 
how  constituents  should  the  attached  to  each  other  m  constructing  a  representation  of 
the  syntactic  structure  of  a  sentence. 

in  examining  each  of  these  cases,  't  will  be  helpful  to  have  two  basic  properties  of 
'"teractive  activation  systems  m  view  The  first  is  that  contextual  influences  often 
produce  what  i  will  call  selective,  as  opposed  to  predictive  effects  The  second  is  that 
contextual  effects  ••  indeed,  the  effects  of  any  factor  -  can  be  masked  by  strong  effects 
of  other  factors  The  first  fact  will  be  useful  when  we  come  to  interpret  evidence  that 
context  appears  to  exert  primarily  a  selective  effect  m  certain  lexical  ambiguity  resolution 
experiments  the  second  will  be  most  relevent  when  we  examine  evidence  that  semantic 
context  effects  do  not  show  up  in  the  initial  processing  of  certain  grammatical 

constructions 

To  illustrate  the  first  point,  let  us  consider  the  recognition  of  an  ambiguous 
phoneme  embedded  in  a  context  which  should  favor  one  interpretation  over  the  other  A 
simulation  illustrating  this  is  shown  in  Figure  2  using  the  fRACE  model  of  speech 

perception  (McClelland  and  Elman,  1986). 

To  understand  the  simulation,  some  facts  about  the  model  are  necessary  The 
model  consists  of  units  grouped  into  three  processing  levels  There  is  a  phonet'c 
feature  level  a  phoneme  level,  and  a  word  level  Within  each  level  there  are  separate 
pools  of  units  for  each  small  temporal  segment  of  an  utterance  Thus  successive 
phonemes  in  a  word  activate  phoneme  detectors  in  successive  pools  of  units  it  is 
useful  to  visualize  the  feature  units  as  though  they  are  laid  out  m  successive  banks  from 

left  to  right  in  space,  with  banks  of  phoneme  units  above  them  and  banks  of  word  units 

above  the  phoneme  units  Each  bank  of  unit  covers  only  a  small  temporal  window 


/t  /  and  /d  /  in  /  dor?  / 


Figure  2:  nme  course  ot  activation  of  onus  for  d/  and  t/  at  the  end  o* 

string  dar'’  where  the  stands  for  a  segment  amoiguous  between  t  and 
d'  The  time  course  of  activafon  of  the  unit  for  the  word  carr  's  aiso 
shown  above 

Sco^en  '’Cut  s  sweet  across  this  soat'ai  array  'rom  left  to  'ignt  crov'O'pg  n^ui  'c 
‘eature  ur’its  n  successive  banks  as  time  progresses  Connections  between  ‘eatu'e  a^d 
cho''eme  units  allow  active  feature  units  m  a  carticu'ar  bac:^  to  send  e^citatcr,  ''cut  'c 
^nits  for  aoorocriate  phonemes  m  correscendmg  banks  phoneme-to-w'ord  conreC'C’S 
a'low  chenemes  to  send  excitation  to  approcriate  words  m  corresponding  banks  'no'? 
are  also  ‘eedback  connections  from  the  word  level  to  the  phoneme  level  and  frpn’  -"e 
^honomo  to  the  feature  level  in  addition  to  these  excitatory  connections  mere  a^e  a'so 
nhiciiory  connections  between  units  which  scan  overlapping  temocral  regions  'ne 

choneme  'evef  mis  'neans  that  competition  occurs  only  among  alternative 
interpretations  of  the  same  temporal  segment  of  speech 

m  our  example  we  will  consider  an  mput  that  consists  of  mo  pnonemes  a  a 
and  fpilowea  by  a  phonetic  segment  that  is  ambiguous  between  d/  and  i  '■■e 

hgure  'iiusirates  me  buiid-up  of  activation  for  the  phoneme  units  activated  by  t^e  ''pai 
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amoiguous  sound  We  can  see  that  initially,  there  is  a  very  slight  advantage  of  ’he  m 
over  the  d'  This  advantage  stays  relatively  constant  for  a  time,  but  gradually  /t/  begins 

to  dominate  /d/  and  to  push  its  activation  down  While  both  p'^nnemes  are  initially 

activated,  only  one  remains  active  in  the  end. 

Why  is  the  context  effect  so  small  at  first’’  The  primary  reason  has  to  do  with 
the  degree  of  constraint  imposed  by  the  context.  Activation  of  the  /t/  over  the  tg/ 
results  from  feedback  from  the  word  level,  but  at  the  time  the  /t/  and  /d/  are  coming  m 

the  relevant  word  detector  (for  the  word  dart)  is  not  very  active  The  reason  is  simply 

that  there  are  several  other  words  that  are  still  cohsistent  with  the  input  up  to  that 
DOint  These  words  are  all  in  competition,  so  that  none  are  very  highly  activated  The 
ambiguous  phoneme  itself  must  determine  which  of  these  words  is  really  being  said,  and 
'hereby  allow  it  to  dominate  the  possibilities  left  open  by  preceding  portions  of  the  input 
Only  after  the  ambiguous  word  strengthens  the  activation  of  dan  over  its  competitors  can 
dan  really  provide  strong  support  for  the  d/  interpretation  of  the  final  phoneme 

I  want  to  make  it  clear  that  context  can  and  does  exert  stronger  effects  than  we 
see  here  under  some  circumstances  When,  for  example  an  ambiguous  segment  comes 
at  the  end  of  a  long  word  that  has  no  remaining  competitors  a  few  phonemes  before 
fho  ambiguous  segment  is  received,  v/e  see  much  stronger  context  effects  in  the 
simulation  These  effects  are,  of  course,  consistent  with  the  empirical  finding  that  lexical 
effects  in  speech  processing  are  larger  at  later  points  in  words  (Marsien-Wilson  and 
Welsh.  1978:  Samuel.  1981). 

The  essential  point  is  that  context  that  is  clearly  strong  enough  to  exert  a  potent 
role  in  determining  the  eventual  outcome  of  processing  may  very  well  exert  its  influence 
primarily  by  selecting  among  alternatives  as  they  are  becoming  activated  bottom-up  An 
initial  slight  advantage  is  generally  observed  for  the  contextually  appropriate  alternative 
but  both  appropriate  and  inappropriate  alternatives  may  receive  considerable  activation 
before  the  resolution  of  the  ambiguity  is  complete 

Now  we  consider  the  second  point,  namely  that  effects  of  context  can  be  blocked, 
if  there  are  other  factors  that  are  exerting  stronger  influences  To  demonstrate  this  I 
will  show  the  results  of  two  more  simulation  runs  with  the  TRACE  model,  using  an 


I 


■ri!e'’3C''on  in  .anguage  “'0C‘;s3''' 


^''af^cgLC'LS  ‘'f’ai  a  cf^e  case  arid  30  ■jf'amc'gucus  ''f^ai  '  m  tee  c'^er  c'^ececed 
C'.  ’"e  5"  ''5  da'’  ‘■‘ere  ccf^text  seouid  succert  lee  t  S''''ce  cad  -s  a  .•.cd 

as  Pigure  3  seo.vs  .vhen  the  input  'S  unambiguous  d  produces  strong  bctiom-uo 
sucoon  rcr  tnp  phoneme  actually  ofesented  and  this  actually  biceps  out  the  etteci  o( 
context  almost  comcietely 
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Figure  3:  Time  course  Of  activation  of  detectors  for  the  final  n  m  .'darft  and  'hp 
final  'd/  in  /dard/  Also  shown  above  's  the  time-course  of  activation  of 
the  detector  tor  the  word  dart  m  each  case 

Though  there  is  a  slight  advantage  for  the  tf  it  is  very  small  and  might  easily  go 
undetected  m  an  experiment  Certainly  there  •$  no  doubt  that  a  ft'  will  be  heard  m 
one  case  and  a  'd/  m  the  other  The  reason  is  that  with  strong  bottom  uo  moui 
‘avonng  a  oarticuiar  interpretation,  the  correct  ans’.ver  is  Quickly  locked  mto  the  svs'em 
and  keeps  the  alternatives  from  becoming  activated,  due  to  competitive  inhibition  ampng 
units  standing  for  alternative  interpretations  at  the  same  level  The  differential  feedback 
suDDort  'hat  the  ti  receives  does  not  realty  become  strong  enough  to  influence 
crocessing  until  it  is  too  late. 


/d  /  in  /  dard  /  and  /  t  /  in  /  dart  / 


/d//a//r//?/ 


Time  Steps 


u.) 
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Again  I  want  to  make  clear  that  the  effect  of  context  would  be  stronger  m  other 
cases  When  there  is  a  strong  expectation  before  the  target  occurs,  feedback  from 

higher  levels  can  act  as  a  second  source  of  excitation  favoring  the  one  alternative 
under  these  conditions  the  contextually  favored  alternative  will  have  more  of  an 
advantage  But  in  many  cases,  a  context  that  would  be  sufficient  to  disambiguate  a 
borderline  stimulus,  as  we  saw  in  the  previous  simulation,  will  have  very  little  effect  when 
the  stimulus  is  not  borderline  as  in  the  present  case 

These  kinds  of  effects,  where  a  strong  cue  overshadows  the  effects  of  a  weak  cue 
that  is  known  to  operate  under  other  circumstances,  are  absolutely  ubiquitous  in  the 
literature  They  are  nicely  explained  by  the  interactive  activation  approach,  and  by  other 
models  such  as  the  Oden-Massaro  information  integration  model  lOden  and  Massaro 
19T81  As  just  one  example  Ganong  (19801  found  |ust  these  kinds  of  effects  m  his 

initial  studies  of  the  lexical  effect  in  phoneme  identification  He  reported  that  context 
biased  the  interpretation  of  ambiguous  sounds  at  or  near  the  boundary  between  two 

phonetic  categories,  but  did  not  alter  the  interpretation  of  unambiguous  sounds  well 
within  one  category  or  another  One  hears  the  /k/  in  (strongly  articulated)  k/fr  correctly, 
m  spite  of  (he  unfavorable  context.  Simulations  reported  in  Elman  and  McClelland 

(1986)  show  that  these  sorts  of  effects  are  expected  in  the  interactive  activation 

framework 

Given  these  preliminary  observations,  we  are  now  ready  to  consider  the  case 

against  interaction  in  lexical  access  and  in  syntactic  analysis  in  the  first  case,  the 

claim  has  been  made  that  initial  access  to  words  occurs  autonomously,  without  regard  to 
context,  and  that  higher  levels  simply  select  the  appropriate  word  from  those  that  are 
made  available  by  the  autonomous  access  mechanism  (Tanenhaus.  Leiman.  and 
Seidenberg,  1979.  Seidenberg.  Tanenhaus.  Leiman.  and  Bienkowski.  1982)  In  the 

second  case,  the  claim  is  that  the  syntactic  processing  of  a  sentence  is  encapsulated 
so  higher  levels  of  processing  only  accept  or  reject  possible  parses  presented  to  them 
by  the  syntactic  level.  I've  chosen  to  examine  these  cases  for  two  reasons.  First,  they 
are  both  often  cited  as  evidence  of  autonomy  and  so  they  are  worth  considering,  in  and 
of  themselves  Second,  they  each  illustrate  characteristics  of  the  interactive  activation 
framework  that  ought  to  be  taken  into  account  in  attempts  to  argue  against  an 
interactive  position 
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Word  Sense  Disambiguation 

There  are  now  several  studies  using  a  cross-modaiity  priming  paradigm  to  studv 

word-sense  disambiguation.  The  first  two  such  studies  were  those  of  Tanenhaus  Leiman 
and  Seidenberg  (1979)  and  of  Swinney  (1979)  In  these  and  other  studies,  the  following 
pattern  has  been  found  immediately  after  an  ambiguous  word  both  meanings  appear 
to  be  activated,  even  when  context  is  provided  which  favors  one  interpretation  of  the 

target  word  over  the  other  After  a  delay,  the  only  contextually  appropriate  meaning 

appears  to  remain  active 

This  pattern  of  results  has  been  interpreted  as  favoring  a  view  that  i  will  call  the 
autonomous  'e^'ucai  access  position  (Tanenhaus.  Leiman  and  Seidenberg  I979i 
According  to  this  position  the  process  Of  accessing  meanings  of  words  is  driven  on(v  bv 
(he  bottom-up  processing  of  the  stimulus  coniext  operates  only  later  to  select  among 
the  alternatives  that  are  made  available  by  the  bottom-up  access  process 

In  this  section.  I  will  argue  that  the  results  indicate  instead  a  pattern  that  is 
conforms  to  what  we  would  expect  from  an  interactive  activation  model  initially  both 

meanings  appear  to  be  accessed,  but  -  and  this  is  the  crucial  point  --  the  evidence 
suggests  that  the  contextually  appropriate  reading  is  m  fact  favored  over  the  contextually 
inappropriate  reading  even  early  on  in  processing 

In  documenting  this  claim.  I  will  focus  first  on  the  experiments  of  Swinney  ii979) 

He  presented  ambiguous  words  like  "bugs"  in  contexts  which  favored  one  or  the  other 
meaning  of  this  word  (insects  or  snooping  devices)  The  ambiguous  word  occurred  in  a 
spoken  passage,  and  subjects  listened  to  the  passages  through  earphones;  at  the  end  of 

the  ambiguous  word,  they  were  tested  with  a  visually  presented  probe  word  This  word 

could  be  related  to  the  contextually  appropriate  meaning  of  the  ambiguous  prime  word 
{ants),  to  the  contextually  inappropriate  meaning  {spy),  or  it  could  be  unrelated  to  the 
ambiguous  word  (sew)  The  task  was  simply  to  indicate  whether  the  visually  presented 
probe  was  a  word  or  not.  Non-word  probes  were  of  course  presented  on  other  (rials. 

The  results  of  Swinney  s  experiment  showed  faster  lexical  decision  reaction  times  to 
probes  related  to  both  meanings  of  the  ambiguous  prime  word,  relative  to  control 
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^-'ore  .'.as  a  'C  pisec  advantage  'or  v-ie  'arget  ^e'ated  to  ;ne  conte'fjaiiv  accccpa'e 
-"egn.r^g  of  •■'p  arrt'gL'Ous  prirne  and  a  50  "^sec  advantage  'or  'ne  target  'e'a'ed  'o  '^e 
contextually  ir^apcrocnate  meaning  of  the  prime  Both  .vere  significantly  faster  't^an  me 
'esponses  m  the  control  condition 

In  a  follow-up  study  Swmney  replicated  his  first  experiment  and  compared  me 
results  to  the  results  of  a  second  condition  m  .vhich  the  probe  was  delayed  bv  three 
Svi'apies  At  0  delay  the  appropriate  probe  showed  38  msec  facilitation  and  the 
■naopropnate  probe  Showed  3i  msec  After  the  delay  the  appropriate  probe  showed  -i’ 
msec  and  the  inappropriate  probe  was  1  msec  slower  than  control-  Because  the 
second  experiment  contains  all  of  the  relevant  conditions  i  have  graphed  the  results  m 
-■gure  a 


Figure  4;  inferacfipn  of  context  and  delay  m  the  cross-modal  priming  experiment  of 
Swmney.  1979 

The  basic  pattern  of  results  obtained  by  Swmney  was  also  found  bv  Tanenhaus 
Leiman.  and  Seidenberg  (1979).  hereafter  called  TLS.  and  by  Seidenberg.  Tanenhaus 
Leiman  and  Bienkowski  (1982).  hereafter  called  STLB  in  fact,  m  two  conditions  of 
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STLB  ifor  noun-noun  ambiguities  m  Experiments  2  and  4)  there  was  a  significant 
selective  cnming  effect  at  0  delay  However  In  four  other  conditions  over  the  two 
experiments,  priming  of  both  meanings  was  found  Looking  just  at  the  six  different 
experiments  finding  priming  of  both  meanings  at  0  delay  (two  of  Swinney  s.  one  from 
TLS  and  three  from  STLB)  we  find  that  in  five  of  the  six  cases,  the  contextually 
appropriate  target  receives  stronger  priming  than  the  inappropriate  one  These  findings 
are  summarized  in  Table  1  TLS  and  STLB  also  provide  confirmation  that  at  a  delay 
there  is  strong  selection  of  the  contextually  appropriate  reading:  they  used  a  delay  of 
200  msec  by  which  time  the  contextually  inappropriate  probe  word  showed  no  residual 
priming. 

Table  i 

Priming  effects  of  Ambiguous  Words  in  Context.  0  delay 


Appropriate 

Meaning 


TLS  1979 

33.5 

Swinney  1979 
Expt  1 

70 

Expt  2 

38 

STLB  1982 

Expt  3 

17.5 

Expt  4 
(noun-verb) 

16 

Expt  5 

20 

MEAN 

32.5 

Inappropriate 

Meaning 

A  >  I  ? 

22 

YES 

50 

YES 

31 

YES 

13.5 

YES 

28 

NO 

15 

YES 

26.5 

5  out  of  6 

While  the  fact  that  both  meanings  are  initially  primed  is  consistent  with  an 
autonomy  position,  this  result  is  also  completely  consistent  with  an  interactive  account 
Based  on  our  earlier  simulation  with  the  ambiguous  /d/-/i/  stimulus,  this  is  just  what  we 
expect  to  see  Of  course,  the  consistent  slight  advantage  of  contextually  appropriate 
targets  at  0  delay  is  also  what  we  expect  on  an  interactive-activation  account  Further 
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suDport  for  the  idea  that  there  is  a  context  effect  for  0-deiay  probes  is  provided  by 
some  observations  of  Simpson  (1984),  regarding  another  experiment  by  Onifer  and 
Swinney  (1981)  He  noted  that  Onifer  and  Swinney's  experiments  collected  reaction 
times  to  probes  for  each  meaning  of  an  ambiguous  word,  both  when  the  context  favored 
that  meaning  and  when  it  favored  the  alternative  meaning  He  then  compared  lexical 
decision  times  when  the  context  was  appropriate,  against  lexical  decision  times  when  the 
context  was  inappropriate,  and  found  that  decision  times  were  consistently  faster  with 
appropriate  context^ 

The  fact  that  selection  is  complete  at  a  longer  delay  is  also  fully  consistent  with 
the  activation-competition  processes  that  are  assumed  by  the  interactive  activation 
approach  indeed  the  simulation  shown  in  Figure  2  is  fully  consistent  with  the  pattern  of 
"•esuits  that  we  see  m  these  experiments 

The  initial  advantage  for  contextually  appropriate  readings  is  small  enough  that  it 
does  not  generally  show  up  as  significant.  An  interactive  approach  predicts  that  it 
should  be  possible  to  produce  relatively  strong  contextual  effects,  even  at  short  delays, 
when  the  context  exerts  relatively  strong  constraints.  The  question  arises,  then;  should 
we  have  expected  the  contexts  used  in  these  studies  to  produce  strong  effects?  In 
general  if  is  difficult  to  give  a  definitive  answer  to  this  question  since  investigators  have 


3 

I  should  meniioo  two  somewhat  countervailing  caveats  concerning  the  mteroretation  oi  data  irr-m  ihoca 
evoenments  On  the  one  hand,  the  response  to  the  probe  does  not  occur  until  several  hundred 
milliseconds  after  the  pnming  vord.  even  when  the  probe  loiiows  the  ambiguous  vnrd  with  0  deiav  rhijs 
•  hpro  ,5  room  lor  post-access  processing  of  the  ambiguous  -ord  before  'he  response  to  the  probe  'S  made, 
even  ■■ 'ih  a  0  msec  deiav:  an  autonomy  position  could  aiwavs  laire  refuge  m  such  a  possibiliiv  '0  erD'am 
a.vav  effects  of  comevt  at  0  deiav.  On  me  other  hand,  it  has  been  noted  mat  mere  mav  be  scn-ie 
bacirward  o''ming  eifects  of  me  prime  on  me  ambiguous  word  iGlucirsberg.  Kreuz.  and  Rho.  '986'.  mis 
might  have  artificiaiiv  raised  me  acuvahon  of  me  conievfuailv  inappropriate  readmg  at  0  deiav  'but  see 

Co-Oorrhorg  of  al  'QPJi 
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not  tended  to  focus  specifically  on  the  degree  of  constrainf^  The  matter  certainly 
deserves  further  scrutiny  However  mere  is  one  experiment  that  supports  the  prediction 
that  relatively  stronger  contextual  effects  will  be  found  early  in  processing  when  relatively 
strong  contexts  are  used.  An  experiment  by  Simpson  (1981)  bears  directly  on  this  point 
He  selected  a  group  of  60  ambiguous  words  and  identified  for  each  word  a  dominant 
and  a  non-dominant  meaning.  He  then  constructed  five  context  sentences  for  each 
word,  one  that  strongly  favored  the  dominant  reading,  one  that  weakly  favored  the 
dominant  reading,  one  that  was  neutral,  one  that  weakly  favored  the  subordinate  reading, 
and  one  that  strongly  favored  the  subordinate  reading.  He  presented  these  sentences  to 
subjects,  then  followed  the  final  word  with  a  probe  related  either  to  the  dominant  or  the 
subordinate  meaning  or  with  a  control,  unrelated  word  The  probe  occurred  120  msec 
after  the  offset  of  the  ambiguous  prime  word 

I  have  graphed  the  facilitation  effects  Simpson  found  m  Figure  5.  as  a  function  of 
the  strength  of  the  context  (from  strongly  favorable  to  the  meaning  related  to  the  probe 
to  strongly  unfavorable)  separately  for  the  Dominant  and  Subordinate  probes  As  the 
figure  makes  plain,  there  is  a  strong  effect  both  of  Dominance  and  of  Context,  as  well 
as  a  Context  by  Dominance  interaction.  The  interaction  is  such  that  when  the  context 
IS  strong,  it  completely  wipes  out  the  effect  of  dominance  Only  when  the  context  is 
.veak  or  neutral  's  a  strong  dominance  effect  found 

The  effects  shown  in  this  figure  are  exactly  the  kind  of  effects  we  would  expect  to 
find  from  an  interactive  activation  model.  Each  of  the  two  factors  manipulated  should 
produce  an  effect,  but  only  when  it  is  not  dominated  by  the  other  factor.  These  kinds 
of  effects  are  ubiquitous,  as  I  have  already  noted  and  are  naturally  accounted  for  by 


an  intPrarti.;p  aCT'vallOn  OOinl  ot  VIA'*'  D'ACi'C'ab'I'lv  Ir'nm  *he  C^ACAdfAg  r^nnlA’*!  "A  ‘ 

Drobab'l'lvl  Orov'dAS  a  'AaSOnablA  OOAfal'Onal  dAlmili'an  o*  dAg^AA  Cl  conslramt;  I'oni  IHa  Sinv.ilat'An  "I'A  thp 
nout  'daf’'  'I  ■vas  clear  ibat  even  -vhen  there  are  only  three  DossibiiHies  consistent  '/vith  the  br'or  coniori. 
the  coniexi  exerts  bnniarilv  a  selective,  rather  than  a  oredrcnve  eheci.  in  'his  light,  the  Dredcmma'-viv 
selective  catier’^  that  's  observed  'n  the  cross-modal  experiments  seems  consistent  with  mv  own  best  guess 
about  'ho  Dredictiveness  of  the  contexts  used  in  S*'innev  ''979’  a  S'ngie  examoie  stimulus  o'/en  n 

,vh";h  in^ro  .3  a  shcngiv  constraining  context  hiQ-.-'ever.  an  examination  c'  the  'iiii  set  o<  materials  '.ised  bv 

Oniter  and  Sr/innev  ('98')  indicates  that  in  these  later  studies,  at  'east,  there  -'as  a  ■ 'de  -ange 
contextual  constramt  ^or  examoie  consider  the  context'  'Th®  oitice  nails  "ere  so  thin  that  thev  couid 

hear  tho  II  seems  iikeiv  that  subiects  asked  to  guess  -'ouid  supoiv  a  vanetv  of  dinereni  continuations, 

v'th  nrg.  the  actual  ambiguous  "'Ord.  being  onlv  one  ot  many  DOSSibilitieS 
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Data  From  Simpson  (1981) 


Context  (Favorable  to  Unfavorable) 


Figure  5;  Effects  of  dominance  and  context  from  Simpson  I98i  Data  from  t.vo 

groups  of  subjects  are  combined  One  group  received  the  strong  and 
neutral  contexts,  and  the  other  received  the  weak  and  neutral  contexts  For 

the  neutral  condition  i  have  connected  the  points  through  the  mean 

averaged  over  the  two  groups.  The  nonzontais  at  the  top  and  bottom  of 

the  vertical  bars  represent  the  values  obtained  by  the  strong  and  weak 

context  groups,  respectively 

the  principles  of  interactive  activation  Unfortunately  there  was  a  delay  of  120  msec 

after  the  ambiguous  word  m  Simpson  s  experiment  before  the  presentation  of  the  probe 
thus  there  is  room  to  argue  that  the  strong  effects  of  context  that  he  observed  were 
due  at  least  m  part  to  this  delay.  Thus  a  definitive  test  of  the  predicted  immediate 
context  effect  with  strongly  constraining  contexts  must  await  further  research 


Thus  far  I  have  argued  from  characteristics  of  interactive  activation  mecnaniems  as 
observed  m  simulations  of  lexical  effects  on  phoneme  perception  Some  readers  miny 
wonder  whether  these  general  characteristics  of  interactive  activation  mechanisms  can 
actually  be  incorporated  m  a  working  model  of  meaning  selection  in  fact  botn  CoftreH 
1 1985)  and  Kawamoto  it 985)  have  developed  simulation  models  that  incorporate  me 
ormciples  of  interactive  activation  and  that  exhibit  effects  m  meaning  selection  that  are 
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analogous  to  those  that  i  have  described  for  the  speech  oerceotion  simulations 
Kawamoto  s  model  used  distributed  patterns  of  activation  over  an  ensemble  of  units  to 
represent  the  alternative  readings  of  an  ambiguous  word,  instead  of  the  local 

representations  that  have  been  used  m  the  interactive  activation  models  of  visual  word 
perception  and  speech  perception  In  spite  of  this  difference,  his  model  produces  the 
same  kinds  of  effects  that  we  have  seen  m  other  interactive  activation  models^ 

I  have  argued  that  the  results  we  have  reviewed  are  consistent  with  the  interactive 
approach  but  I  do  not  mean  to  suggest  they  cannot  be  accounted  for  within  an 

autonomy  position  One  possible  account  for  early  context  effects  is  to  suggest  that 
priming  can  occur  within  the  lexical  access  mechanism  itself  Indeed.  Burgess 
Seide'iperg  and  Tanennaus  (1986)  accounted  for  the  initial  selective  access  effects  that 
were  found  m  two  of  their  experiments  m  terms  of  such  effects  Intra-lexlcal  priming 
might  also  be  cited  as  a  possible  source  of  the  advantage  for  contextually  appropriate 
readings  in  other  studies  Unfortunately,  the  case  for  this  is  far  from  clear  at  this  point 
No  definitive  studies  have  been  done  showing  that  contextual  effects  only  result  from 
intraiexicai  factors,  controlling  for  degree  of  constraint  It  would  seem  that  it  behooves 
researchers  on  both  sides  of  this  debate  to  find  ways  of  separating  degree  of  constraint 
from  intra-  vs  mter-ievel  source. 

An  autonomy  account  can  also  be  salvaged  if  it  is  assumed  that  the  observed 
priming  effects  reflect  the  results  of  post-access  processes  Thus  as  l  stated  at  the 

outset,  the  finding  that  there  are  effects  of  context  on  responses  to  early  probes  is  not 

compelling  evidence  against  an  autonomy  account  My  purpose  has  only  been  to  show 
that  the  facts  that  have  emerged  from  these  cross-modal  priming  studies  do  not  speak 
against  an  interactive  position. 

Let  me  note  in  closing  that  there  are  tests  that  can  be  done  to  test  ihe  intcfaci've 
account  A  strong  test  would  be  to  examine  whether  context  influences  the  activation  of 
the  meanings  of  an  ambiguous  word,  even  under  conditions  where  it  is  strong  enough  lo 


‘/ou'd  'lire  fo  ac^f^O'vfecJge  c^n^rfbutions  Aia^  Kas^'amo^o  s  •Of''  to  this  cart  ot  thts  ;^rf'rip 

“^i^  simulations  and  his  revie'v  of  the  literature  that  served  as  the  bas's  for  tht$  discuss*on  of  'e^'oai 
resolution 
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allow  sudiects  to  guess  the  identity  of  the  ambiguous  word  Quickly  and  correctly  from 
the  contextual  information  alone  In  such  a  case  interactive  activation  predicts  that  the 
inappropriate  meaning  will  be  less  active  at  the  earliest  point  that  shows  activation  for 
either  meaning 

Autonomy  of  Syntax 

Recently  Lynne  Frazier  and  her  associates  have  proposed  that  syntactic  processing 
is  autonomous,  in  Frazier  (1986).  the  suggestion  is  made  that  the  syntactic  processor 
initially  makes  decisions  m  terms  of  a  very  general  principle  known  as  minimal 
attachment  and  provides  a  single  parse  fo  a  "thematic  processor"  for  acceptance  or 
reiecf'on  Here  I  am  not  so  much  concerned  with  the  specific  principle  of  minimal 
attachment  per  se.  as  with  the  more  general  claim  that  initial  parsing  decisions  are 
unaffected  by  constraints  arising  from  semantic/themaiic  considerations'  l  will  consider 
two  experiments  that  have  been  taken  as  evidence  for  the  autonomy  position,  both 
reported  in  Rayner.  Carlson,  and  Frazier  (1983)  The  first  shows  that  plausibility  based 
on  knowledge  of  real-world  constraints  has  little  or  no  effect  on  the  initial  processing  of 
reduced  relative  clauses  attached  to  sentence  initial  noun  phrases.  The  second  shows  a 
reading-time  advantage  for  sentences  containing  a  prepositional  phrase  that  is  minimally 
attached,  compared  to  matched  sentences  in  which  the  ultimate  interpretation  requires 
non-minimai  attachment  I  will  discuss  these  in  turn  dealing  with  the  first  one  rather 
more  briefly 

Reduced  relatives  In  Rayner  et  al.'s  first  experiment,  subjects  read  reduced 
relative  sentences  like  the  following 

(la)  The  florist  sent  the  flowers  was  very  pleased 
Such  sentences,  of  course,  have  been  well-studied  since  the  early  work  of  Sever  (1970) 
who  used  them  to  support  his  argument  for  a  particular  sentence  processing  strategy  he 
called  the  "NVN"  strategy.  According  to  the  NVN  strategy,  a  sequence  that  can  be 

s  ^ 

'!  do  not  nean  to  take  a  oarticuiar  stand  on  the  eoc  onaMctenzai'on  of  the  higner-ievei  laco'S  mai 
can  be  bfougbt  to  bear  on  syntactic  orocessmg^  by  seman-'c/tt'emaiic  constraints  (henceforth,  smioiv  rai'ed 
semantic)  i  mean  to  include  a  range  oi  constraints  that  arise  from  our  knoviedge  o(  the  meanings  of  -'crds 
and  of  the  -javs  me  entities  thev  refer  to  might  oiausibiv  be  interrelated  m  the  situations  that  .-e  descnbe 
■n  sentences 
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interpreted  as  noun-verb-noun,  that  is  not  otherwise  marked  as  subordinate,  is  taken  to 
specify  an  actor-action-object  sequence  Phrases  like  "The  florist  sent  the  flowers' 

engage  this  strategy,  and  so  lead  to  a  garden-path  effect,  causing  the  subject  to  slow 
down  and/or  back  up  when  information  inconsistent  with  this  effect  is  encountered 

That  this  NVN  strategy  is  very  potent  in  English  is  indicated  by  the  fact  that  it  is 
strong  enough  to  completely  over-ride  semantic/thematic  constraints  For  example  adult 
English  speakers  asked  to  act  out  the  sentence.  "The  pencil  kicked  the  cow"  will  pick 
up  the  pencil  and  knock  over  the  cow  with  it.  even  though  pencils  are  inanimate  and 
therefore  cannot  ordinarily  kick  (Bates.  ^ytcNew.  MacWhinney.  Devescovi.  and  Smith 
i982i  Apparently  the  NVN  strategy  is  strong  enough  to  override  semantic  constraints  in 
English 

It  IS  important  to  my  argument  to  note  that  m  other  languages  syntactic 

constraints  need  not  be  so  over-riding  For  example  in  Italian,  there  is  a  tendency  to 
use  the  actor-action-object  strategy  in  interpreting  N-V-N  sequences,  but  this  tendency  is 
not  over-riding  for  Italians.  Accordingfy.  Italians  interpret  analogs  of  "the  pencil  kicked 
the  cow"  in  accordance  with  semantic  constraints,  even  though  they  tend  to  treat  the 
first  noun  as  agent  in  more  neutral  sentences  such  as  "The  horse  kicked  the  cow" 
(Bates  et  ai  i982) 

The  point  so  far  is  that  syntactic  cues  vary  in  strength  from  language  to 
language,  and  there  is  no  universal  prepotency  of  syntax  over  semantics  it  just  so 
happens  m  English  that  there  is  a  very  strong  tendency  to  treat  NVN  as  actor  agent 

object.  In  English,  this  particular  syntactic  cue  is  strong  enough  to  override  semantic 

constraints  such  as  animacy  constraints  on  the  agents  of  action  verbs,  as  Bates  et  ai 
have  shown 

In  their  Experiment  1.  Rayner  et  a)  compare  reading  times  for  reduced  rpiniivp 
sentences  like  da)  in  which  the  NVN  =  actor-action-object  reading  of  the  beginning  of 
the  sentence  seems  very  plausible  with  other  sentences  in  which  such  a  reading  seems 
somewhat  less  biausibie  such  as  dbi 

db)  The  performer  sent  the  flowers  was  greatly  pleased 
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Although  oerformers  can  send  (lowers  they  are  less  likely  to  do  so  than  florists  Thus 
one  might  reason  if  subjects  were  able  to  make  use  of  semantic  constraints  m  on-iine 
syntactic  processing  decisions,  then  they  should  not  be  as  strongly  misled  m  sentences 
like  Hb)  However.  Rayner  et  a(.  found  that  subjects  were  slow  to  process  the 
disambiguating  portion  of  the  sentences  (in  this  case,  "was  greatly  pleased")  regardless 
of  the  plausibility  of  the  actor-acticn-object  interpretation  of  the  first  NVN  sequence 
indicating  that  they  were  led  down  the  garden  path  in  both  cases  Similar  null  effects 
of  animacy  of  the  sentence-mitial  noun-phrase  or  of  preceding  context  have  been 
reported  by  Ferreira  and  Clifton  (1986) 

Though  the  consistent  lack  of  an  effect  in  these  cases  might  seem  compelling  at 
'irsf  sight  it  IS  important  to  realize  that  it  does  not  necessarily  mean  that  syntactic 
processing  decisions  are  unaffected  by  plausibility  factors  m  all  cases  We  have  reason 
!0  believe  from  other  research  that  word  order  is  very  powerful  as  a  cue  in  English  and 
that  the  NVN  sequence  is  a  compelling  cue  for  an  Agent-Action-Object  interpretation  in 
contrast  the  plausibility  manipulation  used  by  Rayner  et  ai  seems  rather  weak  for 
example  there  is  no  reason  to  suppose  that  a  performer  could  not  send  flowers  say  to 
a  rival  at  the  opening  of  a  new  show  My  argument,  quite  simply,  is  that  we  cannot 
cut  weak  cues  against  strong  cues  and  expect  that  the  weak  cues  will  produce  strong 
effects  indeed  we  nave  seen  how  strong  cues  can  completely  override  weaker  ones  m 
one  of  our  initial  illustrative  simulations.  We  have  independent  evidence  that 

demonstrates  the  potency  of  the  NVN  strategy,  and  so  vve  cannot  be  surprised  to  find 
that  weak  contextual  constraints  have  no  reliable  effects  The  interactive  activation 
framework  makes  dear  that  if  we  wish  to  find  effects  of  a  particular  factor  we  must 
look  at  situations  in  which  there  are  no  other  factors  exerting  overpowering  effects 

Prepositional  phrase  attachment  Just  such  a  situation  is  provided  by 
attachment  ambiguities  such  as  the  one  that  arises  in  sentences  like  The  bov  hit  the 
girl  with  the  doll"  In  comprehending  such  sentences,  the  reader  must  decide  whether 
to  treat  "the  doll"  as  the  instrument  of  hitting,  thereby  attaching  it  to  the  verb  phrase 
or  whether  to  treat  it  as  an  object  m  the  girl  s  possession  thereby  attaching  it  as 
constituent  of  a  complex  noun-phrase  headed  by  'the  girl" 
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Suc^  decisions  are  clearly  influenced  by  thematic  plausibility  constraints  Cons'aer 
for  example  the  following  sentences 

{2a)  The  spy  saw  the  cop  with  binoculars 
(2b)  The  spy  saw  the  cop  with  a  revolver 

In  the  former  sentence,  we  tend  to  treat  "binoculars"  as  an  instrument;  in  the  latter,  we 
treat  "revolver"  as  a  possession  of  the  cop  In  general,  it  appears  that  the  verb  and 
all  of  the  noun  phrases  influence  these  decisions  Compare  for  example. 

f3ai  The  spy  shot  the  cop  with  binoculars 

(3b)  The  spy  shot  the  cop  with  a  revolver 

and 

'Ja)  The  woodpecker  saw  the  bird-watcher  with  binoculars 

(4b)  The  bird-watcher  saw  the  woodpecker  with  binoculars 

Indeed  Oden  (1978)  has  shown  that  attachment  decisions  can  be  influenced  by  the 
identities  of  the  various  NPs  m  the  sentence  and  by  preceding  context. 

No  one  doubts  the  role  of  these  constraints  in  the  ultimate  interpretations  assigned 
to  sentences.  What  is  at  issue  is  whether  such  constraints  affect  the  initial  attachment 
decisions  subjects  make  in  the  course  of  reading  or  listening.  An  interactive  account 
would  assume  that  the  initial  attachment  decision  is  susceptible  to  influence  from 
semantic  constraints  m  view  of  the  fact  that  both  kinds  of  attachments  are  encountered 
frequently  there  vould  be  no  reason  to  suppose  that  there  would  be  a  strong  syntactic 
bias  in  favor  of  one  attachment  over  the  other  Frazier  however  has  pointed  out  that 
the  attachment  of  the  preposition  phrase  as  a  constituent  of  the  verb  phrase  would 
require  the  creation  of  no  extra  structure,  and  therefore  she  has  proposed  that  verb- 
phrase  (VP)  attachment  is  tried  first  by  the  syntactic  processor,  independent  of  semantic 
constraints 

The  second  experiment  reported  by  Rayner.  Carlson,  and  Frazier  (1983)  adrireqcc'ri 
this  claim  They  presented  subjects  with  sentences  like  (2a)  and  (2b)  above,  with  an 
extra  final  clause  added  and  measured  reading  time  as  in  their  first  experiment  They 
reasoned  that,  if  the  syntactic  processor  initially  prefers  VP  attachments,  then  reading 
times  should  be  slower  for  sentences  like  (2a).  where  a  VP  attachment  turns  out  to  be 
consistent  with  thematic  considerations  The  results  of  the  experiment  supported  this 
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ored'Ction  '■eading  times  were  somewhat  slower  on  and  after  the  disambiguating  word  m 
the  ve'^S'cns  of  the  sentences  where  the  ultimate  reading  favored  attachment  of  the 
oreoositionai  ohrase  to  the  preceding  noun-phrase  (NPt 

While  the  results  were  consistent  with  this  prediction,  it  turns  out  that  there  is  an 
alternative  account  It  is  possible  that  the  effects  observed  by  Rayner  et  at  are  not 
due  to  a  syntactic  preference  for  minimal  attachment,  but  to  the  fact  that,  in  Rayner  et 
al  s  materials  there  is  a  consistent  semantic  bias  in  favor  of  the  minimal  completion 
To  show  this  Taraban  and  McClelland  (in  preparation)  asked  subjects  to  read  Rayner  et 
al  s  sentences  through  the  preposition  at  the  beginning  of  the  critical  prepositional 
phrase  and  then  to  generate  an  expectation  for  the  completion  of  this  phrase  The 

suDiect  then  saw  either  the  VP  or  the  NP  completion,  and  was  asked  to  rate  how  well 

'he  actual  completion  matched  the  expectation  Subjects  rated  the  VP  completions 
Significantly  closer  to  their  expectations  on  average  than  the  NP  completions  (3  62  vs 
2  90  on  a  five-oomt  scale) 

To  determine  whether  it  was  this  greater  concordance  with  expectations  that  was 
determining  the  advantage  for  VP  over  NP  completions.  Taraban  and  McClelland 
constructed  20  additional  sentence  pairs  that  were  intended  to  produce  expectations 
favoring  an  NP  completion  An  example  is 

(5a)  I  read  the  article  m  the 

This  can  be  completed  with  a  word  like  "magazine"  in  which  case  the  PP  is  attached 
to  the  NP  or  with  a  word  like  "bathtub",  in  which  case  the  PP  is  interpreted  by  most 
subjects  as  being  attached  to  the  VP  The  completion  words  used  in  the  two  conditions 

were  matched  over  the  set  of  materials  for  both  length  and  frequency  As  intended  the 

NP  completions  of  Taraban  and  McClelland  s  sentences  were  rated  closer  to  subjects 
expectations  than  the  VP  completions  (3  90  vs.  2  98) 

Once  ratings  had  been  collected,  both  Rayner  et  al  s  sentences  and  Taraban  & 
McClelland  s  new  sentences  were  presented  to  another  group  of  subjects  in  a  word-by¬ 
word  reading  time  task.  At  the  beginning  of  each  trial  the  subject  pressed  a  button 
causing  the  presentation  of  a  row  of  dashes,  blanks,  and  punctuation  marks.  Each  dash 
indicated  the  presence  of  a  letter  in  the  to-be-read  sentence,  with  blanks  indicating  the 
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spaces  oe’.veen  words  The  next  press  of  the  button  caused  the  first  set  of  blanks  to 
be  replaced  with  the  first  word  of  the  sentence  Each  subsequent  press  of  the  button 
caused  the  next  word  to  be  presented  and  the  preceding  word  to  be  replaced  with 
blanks.  The  last  word  of  the  sentence  was  always  the  disambiguating  word.  When  the 
subject  pressed  the  button  after  reading  this  word  a  question  appeared  Subjects  were 

instructed  to  read  the  sentences  as  rapidly  as  possible  consistent  with  good 

comprehension,  and  the  answers  to  the  questions  were  recorded  by  the  experimenter 
Accuracy  was  very  high  and  did  not  differ  between  experimental  conditions  In  addition 
to  the  29  target  sentences  there  were  66  filler  sentences  Seven  of  these  were  used 
to  balance  the  frequency  of  NP  and  VP  attachments  of  sentence  final  prepositional 
phrases  The  remaining  59  were  fillers  of  many  different  types  included  to  vary  the 

materials  so  that  subjects  would  not  get  into  a  set  of  expecting  a  sentence-final 
prepositional  phrase 

The  reading  limes  for  the  final  words  of  the  sentences  are  shown  in  Figure  6a. 
broken  down  by  attachment  and  source. 

Two  things  are  apparent  from  the  results.  First,  with  Rayner  et  al  s  materials,  we 

were  able  to  replicate  their  effect  showing  faster  reading  times  for  VP  vs.  NP 

attachments  Second,  however  we  found  that  with  our  materials,  this  effect  was 

reversed,  and  reading  times  were  actually  shorter  for  NP  completions  than  for  VP 
attachments  There  was  no  main  effect  of  attachment  type,  but  there  was  a  highly 
reliable  interaction  of  completion  type  with  source  (RCF  vs  TM).  There  was  also  a 
mam  effect  of  source,  but  this  is  not  interpretable,  since  Taraban  and  McClelland  s 

completions  were  generally  shorter  and  more  frequent  than  those  used  by  Rayner  et  al 

It  has  often  been  suggested  that  the  lime  spent  reading  the  final  word  of  a 
sentence  reflects  extra,  integrative  processes  that  do  not  occur  at  other  points  Thus 
the  reading  times  Taraban  and  McClelland  observed  in  this  experiment  might  reflect  such 
integration  effects,  and  these  effects  might  be  masking  a  real  effect  of  attachment  that 
would  appear  if  it  had  not  been  overshadowed  by  such  sentence-final  integration  effects 
To  address  this  problem.  Taraban  and  McClelland  extended  the  sentences  For  the 
Rayner  et  al.  sentences  we  used  continuations  they  had  used,  and  for  our  own  we 

constructed  completions  of  the  same  kind  In  ail  cases,  the  continuation  began  with  a 
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Figure  6;  Oooosite  effects  of  attachment  on  reacting  time  for  target  words  triggering 

different  attachment  decisions,  for  sentences  of  Rayner  et  a(.  (1983)  iRCFi 

and  Taraban  and  McClelland  (TM)  in  the  first  Experiment.  (a)  me 

sentence  ended  with  the  target  word  and  the  reading  times  shown  are  for 
mis  word  only  In  the  second  experiment  ib'  the  sentence  continued  on 

bevond  the  target  word  and  reading  times  are  based  on  the  sum  of  me 
time  spent  reading  the  target  word  and  the  three  following  words 

coniuncfion  that  dearly  indicated  the  beginning  of  a  new  clause,  such  as  while  or 

because ' 


Figure  6b  shows  the  total  reading  time  for  the  target  word  and  the  following  three 
words  broken  down  by  VP  vs  NP  attachment  and  source  Once  again  there  was  no 
mam  effect  of  attachment,  but  there  was  a  strong  attachment  by  source  mieran'i^n 
i^maliv  i^igure  7  shov;s  the  difference  m  reading  times  between  the  VP  and  NP 
completions  of  the  sentences,  on  a  word-by  word  basis,  starting  with  the  disambiguating 
word 

The  figure  indicates  that  there  is  no  effect  of  condition  on  me  reading  time  for  me 
disambiguating  word  itself  However  there  is  an  effect  m  each  of  the  next  two  words, 
by  the  third  word  after  the  disambiguation  the  difference  appears  to  have  disappeared 
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Figure  7;  The  time-course  of  the  processing  difference  between  NP  and  VP 
attachment  versions  of  the  flayner  et  at,  (RCF)  and  Taraban  and  McClelland 
(■nW)  sentences  Times  shown  are  reading  times  for  words  m  the  NP- 
attachement  version  minus  reading  times  for  words  m  the  VP-attachment 
■  ersion  for  the  target  word  and  each  of  the  three  following  words 

f  would  aocear  'ram  this  analysis  that  processing  that  occurred  on  the  disambiguating 
word  when  -t  was  the  last  word  of  the  sentence  is  being  spread  out  over  subseauent 
words  m  this  case  As  before,  there  is  no  evidence  that  this  extra  processing  reflects  a 
disruption  that  occurs  with  non-minimal  completions  m  general  Rather  it  apoears  that 
the  extra  processing  occurs  for  minimaf  or  non-mmimal  completions,  depending  on 
whether  the  VP  or  NP  completion  is  closer  to  the  sub|ects  expectations 

Once  again  i  do  not  intend  to  suggest  that  the  facts  actually  rule  out  'ho 
autonomous  syntax  oosifion  m  favor  of  an  interactive  view,  it  remains  possible  to  suppose 
that  syntactic  processing  is  autonomous,  but  that  what  is  determining  the  reading  times 
we  are  observing  s  not  (or  is  not  simply)  the  output  of  this  Syntacnc  process  On  '^e 
other  hand  the  interactive  activation  approach  deserves  some  credit  for  giving  us 
guidance  m  the  search  for  cases  m  which  processing  times  appear  to  be  dominated  bv 
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semantic  as  opposed  to  syntactic  considerations  At  the  very  least  it  seems  clear  that 
Rayner  et  ai  s  second  experiment  provides  little  reason  to  doubt  that  semantic 
considerations  can  play  a  role  in  syntactic  decisions,  given  the  fact  that  it  appears  to  be 
semantic  and  not  syntactic  factors  that  are  controlling  reading  times  for  these  sentences 


In  summary.  I  would  suggest  that  the  findings  of  Rayner  et  al  need  not  be 
interpreted  as  favoring  any  version  of  autonomous  syntax  hypothesis  Though  syntactic 
cues  are  sometimes  so  strong  that  they  overshadow  semantic  constraints,  we  find  that 
under  other  conditions  semantic  constraints  do  appear  to  exert  relatively  immediate 
effects 


Distinguishing  Interactive  from 
Autonomous  Processing 

Although  some  quibbling  may  be  possible,  the  evidence  appears  to  me  to  be  fairly 
clear  in  supporting  the  following  proposition. 

Decisions  about  representational  units  of  all  kinds  involve  the  consideration  of 
multiple  sources  of  information. 

However,  this  can  be  seen  simply  as  a  restatement  of  some  of  the  basic  findings, 
rather  than  as  a  statement  about  whether  the  processing  system  j  inherently  interactive 
or  not.  To  see  this.  I  will  briefly  consider  two  cases:  The  lexical  effect  on  phoneme 
identification  (Ganong.  1980)  and  the  role  of  semantic  context  in  resolving  the  attachment 
ambiguities  we  have  been  discussing.  In  both  cases,  we  might  account  for  the  results 
with  a  purely  bottom-up  processing  system,  in  which  each  module  operates  completely 
independently  of  influences  from  higher  levels  of  processing.  Thus  in  Ganong  s  case, 
one  may  propose  that  the  phoneme  level  passes  to  the  word  level  activations  indicating 


^The  tact  that  we  used  a  '"'ord-bv-worcJ  reading  time  measure,  ccuoted  ’"th  the  >aci  (hat  our  effects  cniv 
show  uD  on  the  "'Ord  after  the  disambiguating  -'ord.  might  he  tahen  as  e"'dence  that  m  tact  the  eftec's  "e 
observed  occur  after  an  initial  syntactic  attachment  process  that  ■•'orirs  immediately  and  is  'eiiecied  ciiv  m 
eve  tixation  duration,  in  this  context  it  should  be  noted  that  Ravner  et  ai.’s  findings  did  not  shov  lid 
clearly  in  fixations  on  the  target  word:  indeed  the  statistical  evidence  for  their  effect  was  somewhat  weak  m 
their  eve-movement  data,  perhaps  because  subjects  tend  to  overlap  the  completion  ot  higher  levels  of 
processing  vith  the  intake  of  subsepuent  words 
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.vhich  Dhonemes  are  consistent  with  the  input  and  to  what  extent  and  that  the  word 
level  uses  these  graded  activations,  in  conjunction  with  lexical  constraints,  to  determine 
which  wordist  are  consistent  with  the  input.  Thus  if  a  phoneme  ambiguous  between  /g/ 
and  /k/  is  heard,  the  phoneme  level  may  pass  on  the  ambiguity  to  the  word  level 
Ganong  s  finding  could  simply  result  from  choosing  as  an  overt  response  the  phoneme 
that  is  most  consistent  with  the  word  that  the  subject  has  heard.  The  decision  is  still 
based  on  information  from  multiple  sources,  but  this  integration  of  information  does  not 
occur  at  the  phoneme  level  of  processing  within  the  perceptual  system:  instead,  it  occurs 
in  some  later  decision-making  process  that  can  consult  the  final  output  of  the  word  level 

In  the  sentence  processing  case,  the  situation  is  analogous  One  could  suppose 
'hat  the  syntactic  processing  mechanisms  operate  autonomously,  passing  on  to  higher 
levels  the  output  of  a  preliminary  syntactic  analysis  In  the  case  of  attachment 

ambiguities  such  as  those  considered  here,  one  might  assume  (contrary  to  Frazier,  but 
more  or  less  consistent  with  the  recent  view  of  Marcus.  Hindle  and  Fleck.  1983)  that  the 
output  reflects  the  possible  attachments  that  are  consistent  with  the  syntax,  with  each 
activated  to  a  degree  that  reflects  its  relative  likelihood  based  on  syntactic 

considerations.  The  semantic  processor  could  then  make  use  of  this  inform, ation.  in 

conjunction  with  semantic  constraints,  to  achieve  an  interpretation  that  was  jointly 
constrained  by  syntactic  and  semantic  factors. 

This  purely  bottom-up  story  has  many  of  the  same  implications  as  an  interactive 
account,  since  it  explains  how  influences  from  all  levels  can  have  effects  on  the  final 

outcome  of  processing.  It  is  certainly  consistent  with  a  large  number  of  existing 

experiments  on  contextual  influences.  One  might  ask.  then,  whether  there  is  any  way  of 
distinguishing  this  purely  botfom-up  account  from  an  interactive  view. 

Fodor  (1983)  has  made  one  suggestion  He  has  observed  that  to  counter 
unidirectional  accounts,  if  is  necessary  to  show  "that  the  information  fed  back  interacts 
with  interlevels  of  input-processing  and  not  merely  the  final  results  of  such  processing  ’’ 
Thus,  for  example,  if  one  could  show  that  the  results  of  semantic  processing  are  fed 
back  into  the  syntactic  processor  in  such  a  way  as  to  influence  subsequent  syntactic 
processing  decisions,  or  that  the  results  of  lexical  processing  are  fed  back  into  the 
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Dhonetic  level  so  as  to  influence  subsequent  phonetic  processing  decisions,  then  one 
would  have  provided  evidence  that  processing  is  indeed  interactive 

To  illustrate  this  approach.  I  will  describe  a  recent  experiment  by  Elman  and 
McClelland  (submitted)  In  this  experiment,  we  relied  upon  the  fact  that  listeners 
compensate  for  coarticulatory  influences  of  one  speech  sound  on  the  acoustic  realization 
of  neighboring  sounds  in  the  case  we  exploited  the  phonemes  /s/  and  /S/®  alter  the 
acoustic  realization  of  a  subsequent  /t/  or  /k/:  listeners  compensate  for  this  coarticulation 
effect  by  adjusting  the  perceptual  boundary  between  /t/  and  /k/.  so  that  a  sound  that 
would  be  on  the  boundary  in  a  neutral  context  tends  to  be  heard  as  a  /k/  when  it 
occurs  just  after  a  is/  but  as  a  /t/  when  it  occurs  after  a  iS/  We  reasoned  as  follows 
First  we  assumed  that  this  coarticulatory  compensation  is  an  intrinsic  characteristic  of 
processing  at  the  phoneme  level  Given  this,  we  noted  that  it  should  be  possible  to  use 
lexical  constraints  to  get  subjects  to  interpret  a  sound  halfway  between  fsl  and  iS/  as  a 
IS/  in  one  context  and  as  a  iSl  in  another  Now  if.  as  we  assumed,  this  lexical  effect 
operates  by  feeding  back  activation  to  the  phoneme  level;  and  if.  as  we  also  assumed, 
interactions  at  the  phoneme  level  are  responsible  for  the  coarticulatory  compensation 
effect,  then  the  lexical  effect  on  the  ambiguous  isi-fSi  sound  should  trigger  a 
coarticulaiorv  compensation  effect  that  influences  the  phonetic  interpretation  of  an 
ambiguous  /k/-/t/  sound  On  the  other  hand,  if  Ganong  s  effect  operates  only  on  the 
final  results  of  phonetic  processing,  and  does  not  feed  back  anything  to  the  phonetic 
level,  then  we  would  expect  no  coarticulatory  compensation  as  a  result  of  the  lexical 
effect 


We  therefore  took  pairs  of  words  (e  g  .  "tapes/capes")  distinguished  by  initial  /t/  vs. 
/k/  (or  /d/  vs.  /g/.  which  exhibit  the  same  effects  of  preceding  isi  and  iSi)  and 
constructed  from  recorded  tokens  of  these  words  a  set  of  seven  stimuli  beginning  with 
sounds  varying  between  l\l  and  /k/  in  small  steps  Each  of  these  stimuli  was  preceded 
by  one  of  two  context  words.  In  one  experiment,  one  word  (eg.,  ’’foolish")  actually 
ended  in  /$/  and  the  other  (eg..  "Christmas'')  actually  ended  in  /$/  In  another 
experiment,  the  same  context  words  were  used  but  the  final  segments  were  replaced  by 
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an  amtiguous  sound  that  was  determined  m  pre-testing  to  fall  halfway  between  /s/  and 
'S/  here  designated  as  i'>i. 

The  first  experiment  simply  replicated  the  coarticulaiory  influence  of  isi  and  iSi  on 
the  Identification  of  borderline  /t/-/k/  stimuli,  as  previously  described  by  Mann  and  Repp 
(1982);  as  expected,  words  ending  in  /s/  tended  to  lead  to  an  increased  probability  of 
/k/  responses  to  the  subsequent  ni-ikl  stimulus,  while  the  words  ending  in  ISI  tended  to 
lead  to  an  increased  probability  of  /t/  responses. 

The  second  experiment  provided  the  crucial  test  for  the  interaction  hypothesis 
Here  we  found  that  prior  context  did  indeed  trigger  coarticulatory  compensation  for  the 
lexically-determined  -'s/  or  iS/  phoneme  for  example,  subjects  reported  /k/  more  often 
after  Christmas"  than  after  ’fooli'^"  just  as  predicted  The  results  for  several 
context/target  sets  involving  /t/-/k/  and  /d/-/g/  identification  are  shown  in  Figure  8 

The  results  of  this  experiment  demonstrate  that  lexical  influences  on  phoneme 
identification  can  induce  coarticulatory  compensation,  as  predicted  from  the  interaction 
hypothesis.  This  is  exactly  what  we  would  expect  if.  indeed,  feedback  from  the  lexical 
level  actually  does  influence  processing  at  the  phoneme  level,  rather  than  simply 
influencing  the  interpretation  of  the  outcome  of  such  processing.  More  importantly,  the 
experiment  demonstrates  a  method  that  I  think  holds  some  considerable  promise  of 
providing  a  way  of  determining  the  extent  of  interaction  in  perceptual  and  linguistic 
processing. 

It  remains  possible  to  salvage  a  bottom-up  account  for  these  findings,  but  I  do  not 
think  this  is  a  very  attractive  option.  To  do  so.  one  must  suppose  that  compensation 
for  coarticu/ation  is  accomplished  by  the  same  "late”  mechanism  that  uses  lexical 
information  to  make  decisions  about  the  identity  of  phonemes.  This  seems  an 
unattractive  suggestion,  because  compensation  for  coarticulation  is  so  often  taken  as  an 
intrinsic  and  basic  function  of  the  mechanisms  of  phoneme  perception  (see.  for  example 
Liberman.  Cooper.  Shankweiler.  and  Studdert-Kennedy.  1967).  To  ascribe  this  function  to 
some  "later"  level  would  be  to  deprive  the  machinery  of  phoneme  perception  of  one  of 
its  most  crucial  roles:  or  to  needlessly  duplicate  the  intricate  knowledge  of  coarticulatory 
influences  that  is  assumed  to  be  present  in  the  mechanisms  of  phoneme  perception  in 
mechanisms  of  post-perceptual  judgement. 
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Figure  8: 


Identification  curves  for  three  sets  of  experimental  stimuli  used  by  Elman 
and  McClelland  (1986).  The  left  panels  show  the  effects  of  acoustically 
distinct  's"  and  "sh"  sounds  on  /t/-/k/  and  idi-igi  judgements;  the  right 
panels  show  the  effects  of  acoustically  indentical  (lexically  disambiguatedi 
sounds  halfway  between  "s"  and  "sh"  (represented  by  The  label  above 
each  panel  indicates  the  words  that  were  used  to  bracket  the  ambiguous 
't/-/k/  and  lai  ■  /g/  stimuli;  the  labels  associated  with  each  curve  inhir^m 
the  preceding  context  for  the  judgement  percentages  (percentage  'g'  or  k/ 
(udgements.  depending  on  the  continuum)  indicated  by  the  corresponding 
curve 


More  generally,  it  would  always  be  possible  to  say  that  processing  interactions  that 
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are  assumed  to  result  from  intra-level  influences  were  actually  occurring  at  a  higher  level 
and  thereby  to  sidestep  any  possible  applications  o(  Fodor  s  suggested  test  But  this 
step  is  only  palatable,  it  seems  to  me.  if  the  higher-level  decision  can  be  made  using 
information  that  would  ordinarily  be  assumed  to  be  available  to  the  higher  level  Thus, 
its  seems  quite  sensible  to  suppose  that  phonetic  ambiguity  could  be  passed  up  to  a 
later  stage  for  resolution  at  the  word  level  provided  the  word  level  does  it  by  using 
lexical  constraints  But  if  the  word  level  must  use  the  very  sorts  of  information  usually 
attributed  to  the  phoneme  level,  then  the  entire  notion  of  encapsulation  of  knowledge  is 
undermined. 

This  discussion  brings  up  another  point,  and  that  is.  why  bother  with  feedback^ 
What  s  the  good  of  if’  Why  should  it  matter  if  higher  levels  feed  back  information  into 
lower  levels’’  Why  should  they  not  simply  resolve  the  ambiguities  that  are  passed  on  to 
them  whenever  they  can.  and  forget  about  providing  feedback  supporting  one  alternative 
over  the  other’ 

The  good  of  feedback  is  that  it  permits  processing  on  lower  levels  to  be  guided 
from  above,  thereby  allowing  them  to  provide  higher  levels  with  better  information.  Our 
coarticulation  study  gives  one  example  of  this,  if  higher  levels  can  help  lower  levels 
decide  on  the  identity  of  phonemes  that  are  perceptually  indistinct,  then  lower  levels  can 
use  this  information  to  adjust  for  coarticulation  better  than  they  could  otherwise 
Similarly,  at  the  syntactic  level,  if  higher  levels  can  influence  the  formation  of  syntactic 
representations  of  one  constituent,  they  will  allow  the  syntactic  level  to  be  better 
prepared  to  provide  the  best  analysis  of  what  will  come  later  on  in  the  sentence  in 
both  cases,  this  allows  the  lower  level  to  do  a  better  job  in  providing  information  to  the 
higher  level. 


Summary 

In  the  preceding  sections  of  this  paper.  I  have  described  a  framework  for  modeling 
the  process  of  forming  representations  in  processing  written  and  spoken  language.  I 
have  shown  how  this  framework  can  help  us  understand  why  contextual  effects  may  be 
obtained  under  some  circumstances  and  not  others,  and  why  it  often  appears  to  exert 
selective,  as  opposed  to  predictive  effects 
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In  the  course  of  making  these  observations.  I  have  argued  that  some  of  the 
evidence  that  has  been  taken  in  support  of  the  idea  that  lexical  access  and  syntactic 
processing  are  invulnerable  to  external  influences  is  fully  consistent  with  an  interactive 
account  I  do  not  say  that  this  part  of  the  analysis  proves  that  the  autonomy  position  is 
wrong,  only  that  several  of  the  reasons  that  have  been  given  for  believing  it  that  it  is 
wrong  are  far  from  compelling 

Finally  i  have  indicated  that  there  is  hope  of  finding  empirical  evidence  relevant  to 
distinguishing  between  interactive  and  feed-forward  accounts  of  information  processing: 
Such  evidence  takes  the  form  of  demonstrations  that  higher  levels  of  processing  can 
trigger  processes  at  lower  levels,  increasing  the  quality  of  the  results  they  pass  on  later 
to  higher  levels 

It  remains  to  build  explicit  models  of  interactive  processing  at  higher  levels  Of 
course  this  is  a  difficult  task  for  any  processing  framework:  certainly  no  adequate  model 
of  the  formation  of  a  representation  of  the  event  or  scene  described  by  a  sentence  has 
been  proposed  to  date.  From  what  we  know  about  the  susceptibility  of  higher  levels  of 
language  processing  to  contextual  information  (c.f..  Bransford  and  Johnson.  1973),  it 
seems  fairly  clear  to  me  that  any  adequate  model  will  have  to  incorporate  the  principles 
of  interactive  activation  What  is  not  clear  at  this  point  is  how  these  principles  will  need 
to  be  elaborated  and  supplemented  to  capture  the  structural  complexifies  that  arise  at 
higher  levels  This  remains  a  central  issue  for  future  research 
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